PSNet: Parallel Symmetric Network for Video Salient Object Detection

نویسندگان

چکیده

For the video salient object detection (VSOD) task, how to excavate information from appearance modality and motion has always been a topic of great concern. The two-stream structure, including an RGB stream optical flow stream, widely used as typical pipeline for VSOD tasks, but existing methods usually only use features unidirectionally guide or adaptively blindly fuse two features. However, these underperform in diverse scenarios due uncomprehensive unspecific learning schemes. In this paper, following more secure modeling philosophy, we deeply investigate importance comprehensive way propose network with up down parallel symmetry, named PSNet. Two branches different dominant modalities are set achieve complete saliency decoding cooperation Gather Diffusion Reinforcement (GDR) module Cross-modality Refinement Complement (CRC) module. Finally, Importance Perception Fusion (IPF) according their scenarios. Experiments on four dataset benchmarks demonstrate that our method achieves desirable competitive performance.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MSDNN: Multi-Scale Deep Neural Network for Salient Object Detection

Salient object detection is a fundamental problem and has been received a great deal of attentions in computer vision. Recently deep learning model became a powerful tool for image feature extraction. In this paper, we propose a multi-scale deep neural network (MSDNN) for salient object detection. The proposed model first extracts global high-level features and context information over the whol...

متن کامل

Efficient Co-Salient Video Object Detection Based on Preattentive Processing

Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...

متن کامل

Video Salient Object Detection Using Spatiotemporal Deep Features

This paper presents a method for detecting salient objects in videos where temporal information in addition to spatial information is fully taken into account. Following recent reports on the advantage of deep features over conventional handcrafted features, we propose the SpatioTemporal Deep (STD) feature that utilizes local and global contexts over frames. We also propose the SpatioTemporal C...

متن کامل

Impression Network for Video Object Detection

Video object detection is more challenging compared to image object detection. Previous works proved that applying object detector frame by frame is not only slow but also inaccurate. Visual clues get weakened by defocus and motion blur, causing failure on corresponding frames. Multiframe feature fusion methods proved effective in improving the accuracy, but they dramatically sacrifice the spee...

متن کامل

Salient Object Detection using a Context-Aware Refinement Network

Recently there has been remarkable success in pushing the state of the art in salient object detection. Most of the improvements are driven by employing end-to-end deeper feed-forward networks. However, in many cases precisely detecting salient regions requires representation of fine details. Combining high-level and low-level features using skip connections is a strategy that has been proposed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE transactions on emerging topics in computational intelligence

سال: 2023

ISSN: ['2471-285X']

DOI: https://doi.org/10.1109/tetci.2022.3220250